Jupyter at Bryn Mawr College
Public notebooks: /services/public/dblank / jupyter.cs / Examples

Ebola Outbreak 2014¶

Based on Python Notebook: http://nbviewer.ipython.org/gist/arsenovic/d44166390b50f9f15df3

Original data source: https://github.com/cmrivers/ebola

This version: Doug Blank, Bryn Mawr College Computer Science, Oct 27, 2014

Data retrieval¶

%matplotlib inline
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

df = pd.DataFrame.from_csv('https://raw.githubusercontent.com/cmrivers/ebola/master/country_timeseries.csv',
                 index_col=0)
df = df.sort_index()
df = df.fillna(method='bfill')

Cases per country¶

cases_titles = [k for k in df.columns if 'deaths' in k.lower()]
df.plot(y=cases_titles)

<matplotlib.axes.AxesSubplot at 0x7fee9093c9d0>

Deaths per Country¶

death_titles = [k for k in df.columns if 'deaths' in k.lower()]
df.plot(y=death_titles)
plt.legend()

<matplotlib.legend.Legend at 0x7fee8e8a6650>

Total Deaths¶

df['total deaths'] = df[death_titles].sum(axis =1)
df.plot(y='total deaths', 
        title='Total Deaths in \n 2014 Ebola Outbreak')
plt.ylabel('Total Deaths')

<matplotlib.text.Text at 0x7fee8e7e2150>

Exponential Fit¶

Plot Log(deaths) and exponential fit¶

import seaborn as sn
df['log total deaths'] = np.log10(df['total deaths'].values)
sn.lmplot('Day','log total deaths', df)

<seaborn.axisgrid.FacetGrid at 0x7fee8e87b210>

Fit statistics¶

import statsmodels.formula.api as sm

ols = sm.OLS(df['Day'].values, df['log total deaths'].values)
ols.fit().summary()

Dep. Variable:	y	R-squared:	0.867
Model:	OLS	Adj. R-squared:	0.866
Method:	Least Squares	F-statistic:	574.1
Date:	Mon, 27 Oct 2014	Prob (F-statistic):	2.49e-40
Time:	19:32:15	Log-Likelihood:	-466.55
No. Observations:	89	AIC:	935.1
Df Residuals:	88	BIC:	937.6
Df Model:	1

	coef	std err	t	P>\|t\|	[95.0% Conf. Int.]
x1	41.7750	1.744	23.960	0.000	38.310 45.240

Omnibus:	34.203	Durbin-Watson:	0.005
Prob(Omnibus):	0.000	Jarque-Bera (JB):	5.737
Skew:	0.034	Prob(JB):	0.0568
Kurtosis:	1.758	Cond. No.	1.00

Jupyter at Bryn Mawr College

Ebola Outbreak 2014¶

Data retrieval¶

Cases per country¶

Deaths per Country¶

Total Deaths¶

Exponential Fit¶

Plot Log(deaths) and exponential fit¶

Fit statistics¶